Search and retrieval of audiovisual content by integrating non-verbal multimodal, affective, and social descriptors

نویسنده

  • Antonio Camurri
چکیده

One of the research challenges for future search engines concerns the integration of multimodal and cross-modal, nonverbal, full-body, affective, social, and enactive interaction in the process of search and retrieval of audiovisual content. The paper gives a short presentation of the three-year EU project I-SEARCH (EU 7FP ICT STREP), aiming at creating a novel unified framework for multimodal and cross-modal content indexing, sharing, search and retrieval of audiovisual content. A couple of scenarios developing multimodal paradigms of search and retirieval of audiovisual content are introduced and briefly discussed to explain in concrete terms some of the main research challenges that are addressed in I-SEARCH. Finally, the paper presents preliminary results on a specific research challenge: analysis of nonverbal expressive and social behaviour to extract useful information from users for the retrieval of audiovisual content.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audiovisual integration of emotional signals in voice and face: An event-related fMRI study

In a natural environment, non-verbal emotional communication is multimodal (i.e. speech melody, facial expression) and multifaceted concerning the variety of expressed emotions. Understanding these communicative signals and integrating them into a common percept is paramount to successful social behaviour. While many previous studies have focused on the neurobiology of emotional communication i...

متن کامل

Semantic Encoding and Markup of Georeferenced Documents in Polythematic Digital Libraries of Scientific Literature

The paper considers the principles and basic stages of decomposing georeferenced documents oriented to the problems of markup and semantic search. The paper justifies the necessity to develop a multimodal semiotic system and discusses verbal-visual knowledge representation in digital libraries. To represent knowledge, verbal-visual thesaurus is proposed. The thesaurus includes verbal, verbal-vi...

متن کامل

Using MPEG-7 for Automatic Annotation of Audiovisual Content in eLearning Digital Libraries

In this paper we present a prototype system to enrich audiovisual contents with annotations, which exploits existing technologies for automatic extraction of metadata (such as OCR, speech recognition, cut detection, visual descriptors, etc.). The prototype relies on a metadata model that unifies MPEG-7 and LOM descriptions to edit and enrich audiovisual contents, and it is based on MILOS, a gen...

متن کامل

Multimedia search and retrieval using multimodal annotation propagation and indexing techniques

In this paper, a novel framework for multimodal search and retrieval of rich media objects is presented. The searchable items are media representations consisting of multiple modalities, such as 2D images, 3D objects and audio files, which share a common semantic concept. A manifold learning technique based on Laplacian Eigenmaps was appropriately modified in order to merge the low-level descri...

متن کامل

Predicting Student Performance in Verbal Math Problems Based on Cognitive, Metacognitive, and Affective Factors

Predicting Student Performance in Verbal Math Problems Based on Cognitive, Metacognitive, and Affective Factors F. Karimi, Ph.D. A.R. Moraadi, Ph.D. P. Kadivar, Ph.D. R. Kormi Noori, Ph.D. To determine the predictive role of metacognitive, cognitive, and affective factors in solving verbal math problems, a cluster sample of 450 junior high school  students was given ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010